SSML Summary

These speech markup elements are defined in SSML and are available in VoiceXML 2.0. The following table is a summary of the current SSML elements supported in this release of the Vocalocity Voice Browser.

Important: Support is largely driven by the text-to-speech engine. Consult the documentation provided by your TTS vendor for specific information about how SSML elements are supported.

Element

Purpose

Implementation Notes

<audio>

Play an audio clip within a prompt

Implemented as defined in SSML 1.0

The VoiceXML interpreter processes audio content so that only text is sent to the TTS engine; pre-recorded audio is played by the telephony hardware. For more information, see <audio> Element.

<break>

Specifies a pause in the speech output

Implemented as defined in SSML 1.0

<desc>

Provides a description of a non-speech audio source in <audio>.

Implemented as defined in SSML 1.0

<emphasis>

Specifies that the enclosed text should be spoken with emphasis

Implemented as defined in VoiceXML 2.0

Ignored by Speechify 2.1.6.

<lexicon>

Specifies a pronunciation lexicon for the prompt

Implemented as defined in SSML 1.0

<mark>

Place a marker into the text or tag sequence so that it can be referenced

Not supported

<meta>

Define a metadata item as a name/value pair

The following meta properties are supported:

n Expires (http-equiv)

n Pragma (http-equiv)

n Cache-Control (http-equiv)

For more information, see <meta> Element.

<metadata>

Define metadata information using a metadata schema

This element is supported, but not used by the Vocalocity Voice Browser.

<p>

Identifies the enclosed text as a paragraph, containing zero or more sentences

Implemented as defined in SSML 1.0

f<phoneme>

Specifies a phonetic pronunciation for the contained text

Implemented as defined in SSML 1.0

ph attribute is a required attribute that specifies the phoneme/phone string

Speechify 2.1.6 only supports ph attribute with SPR format. See Speechify Users Guide for more information.

<prosody>

Specifies prosodic information – control of the pitch, speaking rate, and volume of the speech output – for the enclosed text

Implemented as defined in SSML 1.0

Speechify 2.1.6 only supports volume and rate attributes.

<say-as>

Specifies the type of text construct contained within the element and helps specify the level of detail for rendering the contained text

Implemented as defined in SSML 1.0

Most types are supported by Speechify. See the Speechify Users Guide for a complete listing.

<s>

Identifies the enclosed text as a sentence

Implemented as defined in SSML 1.0

<sub>

Specifies replacement spoken text for the contained text

Implemented as defined in SSML 1.0

<voice>

Specifies voice characteristics for the spoken text.

Implemented as defined in SSML 1.0

Ignored by Speechify 2.1.6